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Summary, flic j>olyniera.se chain reaction (PCR) is a method 
for flu- selective ainplilieaiii.it nl |)NA ,ir UNA .segments of 
up to 2 kilohasepairs (kh) ur inure in lcii|;ih. "svuilictic 
oligonucleotides Hanking sequences of interest aa- used in 
repeated cycles <if cu/vmatic primVr extension in opposite and 
overlapping directions. The essential steps in each cycle are 
thermal denaiuration of douhle-stranded target molecules, 
primer annealing to both strands ami en/.vmatic svnthesis of 
ONA. Hie use ol the heat-stable l)NA |*olvmerasc from the 
arehebrteiermm Thcrmux ntfttttiims (7« 7 polymerase) makes 
the reaegm amenable to automation. Since both strands of a 
given I^A segment are used as templates, the number of 
target sapiences increases cx|xmcntially. Hie icaciton is 
simple, gst and extremely sensitive. Hie DNA or UNA con- 
tent of :p|iiigJc cell is siifricicni to delect a specific seipience. 
This meMiHl greatly facilitates the diagnosis of mutations „r 
*equcnc^>olynmrphisms ol various iv|kmii human genetics. 
:md the Election of pathogenic components and conditions in 
the context of clinical rcsean li and diagnostics: it is also useful 
in simpg^ing complex analytical or synthetic protocols in 
basic macular biology. This article describes the principles 
of the reaction and discuvses the applications in dillerent areas 
of biomtyjeal research. 



Introduction 

*I1ic extent to which genetic piopetlics or gene activities can 
be studied in molecular terms c'lcpcnds critically on the avail, 
ability of DNA or RNA molecules in numbers of copies large 
enough to warrant analysis by current nicih.uK. To give an 
example: although the cloning of genes involves the mautpul.v 
lion of single molecules, their detection requires subsequent 
amplification in appropriate hosts. We can estimate that, re- 
gardless of the molecular analysis applied, between Ml* and 
HI" DNA or UNA molecules (or gene copies) must be avail- 
able for a single analytical test. Only a few specialized pro- 
tocols, such as in situ hybridization, exist that allow the iden- 
tification of single copy genes in single chromosomes. Obvi- 
ously, any increase in physical sensitivity that could improve 
the resolution by lowering the detection threshold would pro- 
hice otherwise unavailable information. Quite naturally, 
strenuous efforts have been made over the years and are still 
being pursued, to enhance the resolving power of existing 
methods or to design new techniques with the goal of detect- 
ing smallet mimlvis of molecules. 
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A different approach for the investigation of nucleic acid 
sequences has been designed by Saiki et al. ( I W5). These au- 
thors have invented the method of the polymerase chain reac- 
tion (PCR). which is not based on increased detection sen- 
sitivity but. instead, on the expansion of the number of target 
sequences, which are then subject to a conventional analysis. 

An important aspect of the PCR is that selective amplifica- 
tion of a .sequence ol interest reduces, al the same time, the 
background of sequences that are not wanted. This condition 
facilitates not only sequence detection, but also preparative 
manipulations of amplified ONA. liy eliminating the need for 
extensive purification, the PCR minimizes the time and labor 
needed for handling nucleic acids. It may adequately be qual- 
ified as a form of "cell-free molecular cloning- (A.Wilson, 
quoted in .Saiki et al. PJXNa). 
, It is foreseeable that the PCR will enhance the power of 
diagnostic activities that depend on the analysis of DNA or 
RNA sequences: prenatal diagnosis of inherited disorders, ge- 
netic counseling, clinical disease diagnosis, forensic investiga- 
tions and related topics. In addition, this new method turns 
(tut to Ik- very useful in a number of basic research applica- 
tions in molecular biology and genetics. 

'Hie distinct advantages and limitations of this new method 
are described ami discussed in this review article. Typical 
applications of the PCR. which are already in use or which can 
be anticipated in human genetics and in other areas of bio- 
medical research will be considered. 

Rarely has a new technique in molecular biology and ge- 
netics been so successful within such a short time. New ami 
interesting applications are presently being published at a fast 
rate. Because of the speed of this development, it is not my in- 
tention to give a complete description of all conceivable uses 
of PCR. In addiiii.it. detailed experimental protocols are not 
included. Instead, the reader is referred to Mullis and Faloona 
(I9«7) and to Saiki ct al. (l9KSa). The term "amplification- 
has various connotations, even in the context of molecular ge- 
netics. In this ankle, it is used to describe the process and 
result of the PCR. 



The technique of the PCR 

The logic of the reaction is simple in principle. It depends on 
the annealing of oligonucleotides to homologous DNA or 
RNA sequences and on enzymatic DNA synthesis in vitro 
primed by these oligonucleotides. A pair of primers com- 
plementary to In. (I, strands of a ONA molecule and flanking a 
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Hj^I. A simplified scheme of the polymerase chain icactiou (I'C'U) 
wifgjdmihlc-stranded DNA as tcmplaic. *Ilie two primers anil // 
•serve in initiate ONA synthesis in a itcfineit rcpoti. 'Ilie armwx iiuli- 
caUMhe direction of synthesis in each cycle, lisidt cycle consists ol 
lerffjgatc denam ration hy heat, primer annealing and en/ymaiie syn- 
thesis. DNA products of discrete length (defined hy the distance l»e- 
(hi* y ends of the primers; we the urrawhtuuls in the M/» mi# 
line\ f :\w generated Iroui the iluitl cwle on. < >nlv iIicm* molecules ate 
am^ticd e\|Hmenli:illy. Ilie v cuds ol ONA sttands iim d in the Inst 
two^ryelcs are indicated lo facilitate oiientaiion. 'I "he hneai meicasc 
of 4nuUvnlcs wnli heterogeneous leiiftih is diMc-p-udcd heie (« 
mimlw of cycles) 
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Hr.2A,H. Demonstration of the exponential increase in the amount 
of amplified ONA. A I07-hp*Iong HIV I (hum:in immunodeficiency 
virus |) specific DNA fragment was amplilicil hy |»olymcrase 
primed hy two 20mers in the presence of ti|***P|d(Tl' (in .Mini assays 
essentially as inSaiki et al. l'JKX). Mve sep;irate reactions were slopped 
after 7, 10. 15; 17 :iud 20 cycles. Alitpiois (5ul) were acid-prccipiiaicd 
(A) and separated on a K% jmlyacrylamidc gel (see the auior:ulio- 
giaphic analysis in It; the 7«%*yele reaction was omiitctl here). The 
elltciency pei cycle in this cxpci intent was aKwi 50%. M Si/.e marker. 
(Courtesy of M. riordi) 



target region of interest is used lo, direct ONA synthesis in 
repeated cycles in opposite and overlapping directions. In 
each cycle, boih strands are templates for the gen era I ion of 
two new duplex molecules, 'litis leads (theoretically) to a 
doubling of the number of target sentiences in each round of 
synthesis. Thus, the overall increase in this number is expo- 
nential. 

The general course of the read ion with DNA as the initial 
template is outlined in Fig. I. Liach cycle is initialed by melting 
double-stranded DNA at *M"-V5"C (usually for I miu) to ob- 
tain single-stranded templates. 'Iliis step is followed first by 
annealing of the primer oligonucleotides, which are added in 
large molar excess over template strands, ami then by a brief 
pulse of DNA synthesis (normally between 2 and 5 min). The 
primers are al the beginning of (he reaction in I if- fold to 10'-- 
fold stoichiometric excess, depending on the original concen- 
tration of target sequences. The temperatures applied for 
primer annealing (between 5(P and 5.V<\ occasionally below 
5t)"C') and DNA synthesis vary with the cn/.ymcs used (e.g.. 



between 5K* anil 72*V for the hcat-resistenl Taq polymerase) 
and depend also to some degree on the base com|x>sition of 
the primers. "Ilie lower the G I C-couicni, the lower the opti- 
mal temperature for the reaction (Kim and Smithies 19SK). 

Figure 2 depicts the exponential course of the amplifica- 
tion with the DNA polymerase of Thrrmux <tt[ttnticn\ and a 
HIV I DNA sequence as target. *l*his reaction was followed by 
the incorporation of radio-actively labeled nucleotides and 
monitored by autoradiographic product analysts. 

DNA or UNA as a template may be isolated from any bio- 
logical source. It can be obtained from cells (Kawasaki et al. 
IV8X; Kim and Smithies 19XK). hair roots (Higuchi et al. 
IVKKa). sperm (l.i el al. 1VKK) or surgical biopsy tissue samples 
(W. Mommaerts. personal communication), liven DNA ex- 
tracted from embedded archival tissues (Impraim et al. 1987; 
Smit cl al. IWX; l.ai-Goldmau ct al. I°K.S) or from specimens 
of extinct animal species (Taaho and Wilson IVKK) is suitable 
for the amplification reaction. 

lite amplification of UNA sequences (Fig. 3) is preceded 
hy a revet se iransciiplion step, resulting in the generation of a 
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Cyclic •■plUf cation 

Mg.,3. Scheme for amplification with mKNA as initial template. The 
first siep primed by oligonucleotide A is the *yinhc%is tif single- 
siranded cONA by reverse transcriptase. The conversion into a 
ck>uble stranded DNA molecule is achieved in a second step with a 
DNA-itependent DNA polymerase (Taq or Klenow) primed by 
oligonucleotide B. The reaction proceeds in cydes as shown in Fig. I. 
Duubk-siranded DNA products of di.«:rcic length appear from Hie 
third cycle on 



single-stranded cDNA complementary to (he original RNA 
(rnRNA, viral RNA, etc.). This cDNA is then in a second step 
convened inio double-stranded DNA by the action of the 
am pWy»ng DNA polymerase. 

^ffe*^ 01 aproaches arc available for the reverse transcrip- 
tion S m RN As. _cDN A may be obtained cither by oligo(dT) 
prirnTtib (s*-'c Todd ct al. 1987), by random hexamere (Veres el 
al. I|g7; Noonanand Roninson 1988). or directly by gene spe- 
cific gfe'gonuclcotides (Harbarth and Vosbcrg 1988). The two 
formg procedures have the strategic advantage that more 
than&ae rnRNA can be identified using the reverse transcripts 
prese^alf in a single assay mixture. 

TTre reaction with RNA sequences may be complicated by 
a corg^rrcnt amplification of contaminating DNA sequences. 
Bcca&se of the extraordinary sensitivity of the method, small 
num&brs of DNA molecules, otherwise undetectable, could 
contriSute to the products of the reaction. To avoid such com- 
p1ica«]o>i, DNA can be excluded in a number of ways. ONase 
(if fr^ejof RNase) may digest DNA selectively. It is also pos- 
sible ugrcmavc DNA as an amplification target by restricting 
it between the primer annealing sites (Harbanh and Vosbcrg 
1988). If rnRNA from higher cukaryotcs is to be amplified, an 
obvious measure is the selection of primers that recognize 
separate exons. In this case, DNA- and RNA-dcpcndent pro- 
ducts can be distinguished by their different lengths: products 
derived from DNA include intron sequences and are therefore 
longer than those derived from RNA. 

Occasionally, template sequences may be refractory to 
amplification due to stable intramolecular secondary structure 
within template strands, e.g., if G+Ccontents of target 
regions are high. This complication can be overcome by using 
the nucleotide analogue 7-deaza-dGTP instead of dGTP (or in 
addition to dGTP) as a precursor for DNA synthesis. The 
analogue destabilizes intrastrand folding without impairing 
Watson-Crick base pairing between strands (McConlocuc ct 
al. 1988). 

For the synthetic step in the cycle, a number of different 
DNA polymerases can be applied. The original protocol 
(Saiki ct al. 1985) made use of the Klenow fragment of E.coli 
DNA polymerase I. The unmodified DNA polymerase I, the 
DNA polymerase of the phage T4 or the modified T7 DNA 
polymerase can also be applied (Tcynor-MacLachlan 1988; 



Kcohavong ct al. 1988a, b). A critical disadvantage of these 
" enzymes. is their heat lability. Since they do not survive the 
DNA denaturation temperature <9I*-95'C). fresh samples 
have to be added in each cycle. 

The most frequently used enzyme is now the heat-stable 
DNA polymerase of the archebacterium Thermos aquaticus 
designated Taq polymerase (Chicn ct al. 1976). This enzyme 
survives even extended incubation at 95 # C (Saiki et al. 1988a). 
It offers a number of advantages: first, it docs not have to be 
added in each cycle (essentially, with a good enzyme prepara- 
tion, the addition of one unit is sufficient for the entire 
amplification running through 30 or more cycles). Secondly, 
by allowing synthesis at elevated temperature, it reduces the 
chances of unintended oligonucleotide priming by destabiliz- 
ing mismatch-pairing with unwanted target sequences, as may 
result from partial homology with random or related, but not 
identical sequences. This is particularly important if genes or 
transcripts, which originate from multigene families, arc 
amplified. Mismatch-priming is less likely to occur at higher 
than at lower annealing and/or polymerization temperatures 
(Saiki ct al. 1988a). Thirdly, the availability of the heat-stable 
Taq polymerase was a critical prerequisite for the develop- 
ment of automatic equipment for PCRs (see below). 

The PCR is efficient, specific and very sensitive. Regard- 
ing efficiency, the theoretical upper limit of the number of 
product molecules is 2", where n is the number of cycles. This 
means that every target sequence present at the beginning 
could, in 20 cycles, give rise to about a million progeny 
molecules. Under normal experimental conditions, this value 
is not obtained, however. A more realistic average efficiency 
of 85% per cycle (Saiki et al. 1985) reduces the overall yield to 
a value of about 2.2 X 10 5 in 20 cycles (= I.85 2 "). The expo- 
nential increase in the number of product molecules is not un- 
limited. One example of conditions contributing to a gradual 
decrease in the efficiency with increasing cycle number is the 
increasing amount of template molecules that have to be ac- 
cepted by an enzyme that at the same time loses some of its 
activity because of repeated heating. If heat-labile DNA-poJy- 
mcrases are used, the need for adding new enzyme to each 
cycle leads to a gradual change in the assay composition; this 
may affect the catalytic activity of the enzyme. 

An attractive feature is that in many cases target sequences 
do not have to be purified extensively prior to amplification. 
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Fig. 4. Amplification with nested sets of primen. To increase the 
specificity of products obtained from complex template mixtures, the 
reaction can be started on template (T) with the primers A and 5. 
They define the length of the intermediate product (IF). In a sub- 
sequent reaction, this is the template for the primers Cand D, which 
arc located in the region between primers A and B. The length of the 
final product (FP) is determined by Cand D. Altogether four primers 
are involved in template selection. Modified versions of this scheme, 
e.g., with three primers, arc conceivable 
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Identification of c I 
products oo gmim 

Ki*.5. Oligomer restriction for the identification of amplified DNA. 
The amplified region includes a. cleavage site for a restriction endo- 
nuclcusc (RE). PCR products ons denatured and annealed to a 5* end 
labeled oligonucleotide, which did not serve as primer. After cleavage 
of the duplex with the appropriate restriction enzyme, the marker 
fragment carrying the label is identified by gel electrophoresis 



TrgOmcthod works on genomic DNA as a whole or on crude 
mifqjjires of total cellular RNA (Saiki et al. 1988a; Harbanh 
□ntJLVosbcrg 1988). Serum samples can be taken directly as a 
source of infectious (parasitic) DNA or RNA (Larzul et al. 
1988; R. Scclig. personal communication). 

Jgowever. uncontrolled biochemical sample compositions 
an|Qi high degree of DNA or RNA complexity have the disad- 
vantage of reducing reaction specificity (and also efficiency). 
Random primer-target interactions cannot be excluded. Mea- 
sures to enhance specificity are an increase (as already men- 
tioned) in the annealing temperature, the use of nested sets of 
primers, or partial fractionation of crude preparations. Nested 
setsjof primers (Fig. 4) involve one round of repeated syn- 
thesis with one set of primers and a second round with primer 
secprnces located between the primers used in the previous 
roirf d (Engelke et al. 1988; Haqqi et al. 1988). This will essen- 
tially homogenize the sequence of interest if unwanted 
background amplifications pose a problem. The use of rapid 
size fractionation to gain specificity in the amplification of 
target sequences in complex template mixtures has been re- 
ported recently (Beck and Ho 1988). 

The amplified products are verified by a number of 
criteria. The first and most frequently used criterion is the 
length of the product visualized in ethidium bromide stained 
agarose or polyacryiamide gels. In many cases the length of an 
amplified fragment can be anticipated from the known posi- 
tions of primer annealing sites on known target sequences. 
Unpredictable lengths may occur in rare cases with introns of 
unknown composition or if only one of the ends of a DNA or 
RNA target segment is known. The amplification will then 
yield new information, which is in itself a useful application of 
the procedure. Further criteria are predictable restriction 
sites, which arc monitored either directly (Deng 1988; Har- 
barth and Vosberg 1988) or by a procedure called oligomer 
restriction (Kwok et al. 1987). The tatter method uses radio- 
actively 5'-labctcd oligonucleotides. They arc annealed to the 
denatured amplification products and give rise, after cleavage 



with a defined restriction endo nuclease, to labeled fragments 
of distinct length, a particularly fast* albeit indirect, procedure 
to identify an amplified product (sec Fig. 5). 

Alternatively, the products can also be blotted on filters 
according to Southern (1975) or dotted directly and hybridized 
with labeled diagnostic oligonucleotides* which anneal to 
sequences between the primer sites (Saiki et al. 1985; Har- 
banh and Vosberg 1988). Ultimately, the amplified fragments 
can be sequenced directly, occasionally even without further 
purification of the amplified fragments (Saiki et al. 1985: 
Engelke et al. 1988; Kcohavong ct al. 1988b; Vigilant ct al. 
1988). A modified PCR protocol leading to an accumulation 
of single-stranded DNA in the assay mixture greatly facilitates 
sequencing (Gyllcnstcn and Erlich 1988; Innisct al. 1988); for 
.details see the section "Applications in basic molecular bio- 
logy.** Direct sequencing is indicated as the standard proce- 
dure for the verification of. for instance, polymorphic HLA 
haplotypes or of genomic mutations (Todd ct al. I9K7; Schnrf 
ct al. 1988: Wong et al. 1987; Simpson et al. 1988). 

It is still undetermined where the length-limits arc in the 
amplification of target sequences. Encouraging data wore ob- 
tained with purified cloned DNA, which could be amplified 
up to a length of over 2kb (Saiki ct al. 1988a: Kim and 
Smithies 1988). With genomic DNA a length of 2 kb has been 
successfully amplified (Keohavong ct al. 1988b). Reportedly, 
longer products can be obtained. We have achieved a 1.7-kb- 
long myosin heavy chain cDNA fragment, starting with total 
RNA from muscle tissue (M. Pfordl and K. W. Dicdcrich. un- 
published observations). The Taq polymerase may be better 
suited than the Klenow enzyme for the amplification of longer 
sequences (Saiki ct al. 1988a) since its catalytic action is highly 
processive (Innis ct al. 1988) and difficulties due to intra- 
molecular DNA secondary structure are tess likely to occur at 
temperatures optimal for the archebactcrial enzyme (60*- 
7<TC). 

A very important feature of the PCR is its high sensitivity. 
Saiki et al. (1988a) reported that a 10~ 6 dilution of genomic 
DNA containing the p-globin gene into genomic DNA with a 
homozygous deletion of this gene still allowed amplification of 
a p-globin target sequence in a reaction over 60 cycles. This 
result suggests that a target sequence, which is present only 
once in Mr to 10 6 cells, can be detected by amplification. Con- 
sequently, single isolated cells or single sperm are suitable for 
the detection of genomic target sequences (Kim and Smithies 
1988; Li et al. 1988). A relatively high sensitivity has also been 
reported for the detectability of mRNA sequences. Thus, with 
a conservative estimate of 30000 template molecules (prob- 
ably less) in un fractionated total RNA preparations from 
muscle tissue, a p-myosin heavy chain gene fragment was 
amplified to autoradiographic visibility in 20 cycles with the 
Klenow enzyme (Harbarth and Vosberg 1988). In another 
study on muscle-related gene expression, it was observed that 
a single muscle fiber from an avian skeletal muscle is sufficient 
for the detection of myosin heavy chain message (B.Kirsch- 
baum and D.Pette, personal communication). A comparably 
high sensitivity was recently demonstrated in experiments 
showing thatlhe dystrophin mRNA does not normally appear 
only in muscle, but possibly also in minute quantities in non- 
muscle tissues and cells (Chelly et al. 1988). Whether tissue 
cross-contamination contributed to this unexpected result re- 
mains to be seen. On a cellular level, it has been shown that 
the RNA content of a single cell is sufficient for sequence spe- 
cific amplification (Rappolec et al. 1988). 




The dystrophin mRNA analysis has, moreover, a bearing 
on another relevant aspect: the PCR can be used to compare 
mRNA contents of different cells or tissues in at least 
semiquantitative terms. The principle of such a comparison 
relies on co-amplification off two mRNAs, the relative content 
of one of which is known from independent analysis. In this 
study, the mRNA coding for aldolase A was used as an inter- 
nal standard. Alternatively, one could add as a reference a 
known number of copies of in vitro synthesized cRNA molec- 
ules with a sequence similar to that of the tested mRNA. 

Since in vitro DNA synthesis is, by its very nature* an 
error-prone process, sequence fidelity of amplification pro- 
ducts is a point of major concern. A number of reports have 
addressed this question in detail (Scharf ct al. 1986; Saiki et al. 
1988a; Dunning ct al. 1988; Paabo and Wilson 1988). The 
most extensive assessments have been performed by cloning 
and sequencing individual amplified fragments derived from 
regions of the human HLA-DP|i gene (Saiki el al. 1988a) and 
of the human apolipoprotein 0 gene (Dunning ct al. 1988). In 
28 DNA HLA DOP-cloncs, each 239 bp long and inserted 
into a M13-vcctor and all derived from a single individual, no 
deletions or insertions were found, although 17 misincorpo- 
rated bases (mostly transitions) were identified (error fre- 
quency: about 0.25%). Taq polymerase was used in this ex- 
periment, which went through 30 cycles. A similar error fre- 
quency was obtained in the study with the apolipoprotein B 
gcjrt fragments. 

flThe number of misincorporatcd nucleotides depends on 
tw^ojfactors, viz., on the rate of misincorpo ration during syn- 
tn §9 s amJ on ,ny number of "generations", i.e., the number of 
i $y S lclic c y c,cs * Tnis number contributes to the error fre- 
qugpey at the end of the overall reaction, since misincorpor^- 
tionp occurring in an early "generation" arc inflated in number 
in%ach subsequent cycle of doubling. The rate of misincorpo- 
ra l*P n (m) can be determined by the formula m « 2f/c (Hayes 
where /is the frequency of misincorporatcd bases veri- 
fied] by sequencing and c is the number of cycles. For Taq 
pqgmcrasc, the rate was calculated to be about 2 X 10" 4 
(S|gki et al. 1988a). A slightly lower value for this enzyme 
(l.jjx 10" 4 ) was recently reported from a study involving in 
vitjo primer extension synthesis by this enzyme (without 
amplification) and subsequent genetic screening in vivo for 
single base substitutions (Tindall and Kunkel 1988). 

Preliminary studies with products amplified by the Klenow 
enzyme suggest a rate of about 8 x 10" 5 for this enzyme (Saiki 
ct al. 1988a; Oste 1988). Thus, the Klenow enzyme appears to 
be somewhat more reliable in preserving DNA sequences dur- 
ing the process of amplification. The observed rate is, how- 
ever, relatively high compared with data obtained under con- 
ditions of only one round of DNA synthesis (Tindall and 
Kunkel 1988). In the latter study, the Klenow enzyme had a 4- 
to 8-fold lower rate of errors than the Taq polymerase. 

The two enzymes differ in their ability to catalyze 3'-5' 
cxonuclease proofreading. While the Klenow enzyme is en- 
dowed with this ability, the Taq polymerase is not (Chien et 
al. 1976; Tindall and Kunkel 1988). This difference, possibly 
together with the mutagenic effects of high reaction tempera- 
tures (see Drake and Baltz 1976), may at least partially ex- 
plain the relatively high rate of misincorpora lions by the Taq 
polymerase. 

This complication is for most purposes of no consequence 
for the PCR. Analytical procedures such as direct sequencing 
or filter hybridization with allelc-specific oligonucleotides do 



not suffer from the small number of errors in the amplified 
products (up to I in 400 bp are wrong, with a random distribu- 
tion). However, if cloning of individual fragments is required, 
sequences need confirmation by analyzing independent iso- 
lates, in particular if only a few copies of the target sequences 
were initially available. It therefore seems advisable that ex- 
periments involving cloning of amplified DNA should be car- 
ried out with a large rather than a small number of template 
molecules, and amplification should take place in as many 
cycles as arc necessary, but not more. Although the Taq poly- 
merase is generally preferable because of its heat stability, the 
less convenient Klenow enzyme or the phage T4 or T7 DNA 
polymerases with their higher fidelity rate (Tindall and Kun- 
kel 1988; Kuchta ct al. 1988) may in exceptional cases still be 
useful. 

It has already been mentioned that the heat stability of the 
Taq polymerase makes reactions with this enzyme amenable 
to automation. Since in many cases one or at most two addi- 
tions of enzyme are sufficient to maintain a high rate of syn- 
thesis, equipment is needed that is able to regulate tempera- 
ture changes automatically. A number of relatively expensive 
machines are available, but cheap and simple laboratory solu- 
tions have also been proposed (Rollo ct al. 1988; Foulkes et 
al. 1988; Kim and Smithies 1988). We have constructed an 
inexpensive computcr-controlcd mini robot (a portal robot), 
which acts by transferring the incubation vials in a cyclic fash- 
ion from one glycerol bath (for DNA synthesis) to a second 
(for denaturation) and then to a third one (for primer anneal- 
ing). Each bath has a different preset temperature. The vials 
are for each incubation completely Submerged by the robot 
lever. Thus, the temperature transition between the reaction 
steps is fast, and losses of volume due to evaporation are 
avoided (M. Pfordt and K. W. Diederich, unpublished re- 
sults). 

The inherent high sensitivity of the PCR requires a special 
comment regarding experimental care, which may affect the 
interpretation of results. Minute contaminations of benches or 
frequently used laboratory equipment with template molec- 
ules from various sources, including previous amplifications, 
may lead to the inadvertent addition of target sequences 
in assay mixtures and, hence, to false-positive "signals" 
(Simpson ct al. 1988; Lo ct al. 1988b; Kim and Smithies 1988). 
In the course of our experiments, this problem arose occasion- 
ally with a certain time lag after the introduction of a new set 
of primers. This complication, which may be regarded as an 
unwanted demonstration of the power of the polymerase 
chain reaction, requires extreme care in the handling of all 
reaction components and regular quality checks in the form of 
negative control amplifications, i.e., reactions without added 
template. Experimental conditions, such as sterile working 
habits, may even be advisable. 



Applications of the polymerase chain reaction 

Genetic and re fa ted research 

Essentially, three major areas of biomedical research will 
benefit from PCR. These are human genetics, including ge- 
netic services and certain forensic applications, clinical investi- 
gations with the goal of monitoring the causes of disease, the 
progression of disease and therapeutic success and, finally, 
basic molecular biology. Tabic 1 depicts a list of applications 



Table 1. Applications of the polymerase chain reaction 

/. Generic research and counselling 

Detection of mutations 

Prenatal diagnosis of inhcritcU(t)isordcrs 

Prenatal sex determination 

Carrier detection in families and populations 

RFLP linkage studies 

Generation of probes for gene mapping and in situ hybridization 

Population genetics 

Forensic identification of individuals 

2. Clinical investigations 

Pathogen detection and typing 

Identification of activated oncogenes and tumor typing 

Monitoring of disease progression and therapy 

Disease susceptibility studies and preclinical risk assessment 

- -J* Applications in molecular biology 
DNA sequencing 

Ccnc synthesis and gene modification 
Gene expression studies 
Gene targeting 

Site directed mutagenesis \ 



that have already been reported in the literature or which will 
presumably be realized soon. 

rgie usefulness of the PCR for genetic research was docu- 
mented in the first PCR report, in which probing of the sicklc- 
celHrputotion in amplified DNA fragments was demonstrated 
(Sai£i ct d). 1985). Soon thereafter, it was shown that direct 
sccpS ncing of amplified mutant alleles is feasible. This study 
inched the identification of previously unknown mutations 
of jt£ P-globin gene (Wong et al. 1987). * 

¥or the routine analysis of mutants, allctc-spccific oligo- 
nucleotides (ASO) (Connor et al. 1983) can he used in a 
simple "dot-blot" filter hybridization assay that affords a rapid 
distinction of homozygous and heterozygous constellations 
(segTig. 6, taken from Saiki et al. 1986). The detection probes 
were radioactive I y labeled in most cases. However, the appli- 
* cation of non-radioactive biotinylated or enzyme-labeled 
proves has also been reported (Bugawan et al. 1988; Saiki et 
aj. J988b; SyvSncn ct al. 1988; Lo ct al. 1988a). Isolation and 
labeling of DNA probes can be achieved by PCR in a single 
preparative step (Liang and Johnson 1988; Lo et al. 1988a). 
Alternative methods for the identification of amplified globin 
alleles are the oligomer restriction technique (Embury et al. 

1987) or direct restriction mapping of amplification products 
(Chchabct al. 1987; Kulozik ct al. 1988). 

The application of the PCR for prenatal diagnosis and 
mutation analysts of globin-rclatcd inherited disorders has fre- 
quently been described (Chchab et al. 1987; Embury et al. 
1987; Cai et al. 1988; Kulozik ct al. 1988; Bugawan et al. 1988; 
Chan ct al. 1988; Saiki et al. 1988b). Fetal DNA was prepared 
cither from amniocytes or from chorionic villi. An interesting 
prospect is the use of sets of appropriately chosen oligonucleo- 
tides, which allows the rapid identification of multiple muta- 
tions within a given population or geographic area (Cai et al. 
1988; Diaz-Chico ct al. 1988). 

Another genetic trait studied by DNA amplification is the 
a r antitrypstn deficiency. Adult individuals carrying the MM, 
MZ, or ZZ alleles (Bmun Petersen et al. 1988; Newton et al. 

1988) , or fetuses at risk of a r antitrypsin deficiency (Abbott et 
al. 1988) could readily be identified by appropriate ASOs. In 
the latter case, chorionic villus DNA was amplified and diag- 
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Flg.6. Example of an analysis of homo- and heterozygosity in genomic 
DNA by allele specific oligonucleotides (ASO). The ASO probes 
were designed to recognize the hemoglobin allele C (I9C). the sickle 
cell allele S (I9S) and the normal p-globin allele (I9A). Portions of 
genomic DNA (lug) with known p-glohin genotype were amplified 
with the Klenow enzyme. Aliquots of PCR product* were denatured 
and applied to a nylon filter. The ASO probes were labeled at their 5* 
ends with |"P|. After hybridization, the filters were washed for lOmin 
fit 55X (for 19C) or at HTC (for I9S and 19A). The homtv and 
heterozygous genotypes are indicated on the right side. XX is DNA 
from a cell line with a homozygous p-globtn gene deletion. (For de- 
tails see Saiki ct al. 1986; reproduced with permission) 

nostic use was made of a known polymorphic restriction site in 
exon III of the a r antitrypsin genc. In this analysis, probe hy- 
bridization was not required. 

If the disease-causing mutations are known, allelc-specific 
amplification may be used for screening not only sibships car- 
rying certain recessive traits, but even populations. Carriers 
without a family history of the disease could then be detected. 
DiLelle ct al. (1988) have discussed such a measure for the 
identification of PKU carriers and suggested that it is techni- 
cally feasible. This conclusion is corroborated by PCR-based 
genotyping of apolipoprotein E alleles in a cohort of 68 indi- 
viduals (Weisgraber et al. 1988), by studies on the genetic sus- 
ceptibilty to insulin-dependent diabetes mellitus (Gu ct al. 
1988) and also by an investigation of the spectrum of p-thalas- 
semia genes in Spain (Amselem et al. 1988). 

If the mutations leading to on inherited defect arc not 
known, the PCR may provide the missing information by 
amplifying genomic DNA or mRNA sequences (Simpson et 
al. 1988; sec also the globin mutants reported by Wong et al. 
1987). Using the combination of amplification and direct 
sequencing of amplified DNA, a point mutation in the HPRT 
gene HPRT m ««a (which causes gouty arthritis) could be veri- 
fied. The mutant is a C-A transversion, substituting a serine 
by arginine at amino position 103 (Cariello et al. 1988). (This 
substitution was known from protein analysis). In the gene 
coding for the clotting factor VIII, a previously unidentified 
. missense mutation (a G-C transversion) has been detected by 
this procedure (Levinson ct al. 1987). More recently, a muta- 
tion in the factor IX gene was also characterized by PCR 
(Denton et al. 1988). 

A powerful combination for a rapid mutational analysis is 
the use of ribonuclcase A (RNasc A) protection (Myers ct al. 
1985) together with PCR. The mutational site is first localized 
by RNase A protection before it is subsequently amplified 
using the respective genomic DNA or mRNA sequences as 
templates. This protocol avoids the slow process of genomic 
library construction and screening. The method was first used 



„/ Veres ct al. (1987) for a molecular analysis of the murine 
sparse fur mutation resulting from a deficient ornithine trans- 
carbamylase. This mutation serves as a model for the most 
common human urea cycle disorder. A second report based 
on this strategy was conccrrre^ with a dcficicncy in ornithinc- 
£-transcarbamylase. causing gyrate atrophy of the choroid and 
the retina (Mitchell et al. 1988). 

The PCR technique has been applied to the analysis of 
genomic deletions, as they are frequently found in Duchenne/ 
Becker muscular dystrophy patients (Kocnig ct al. 1987). The 
detection of previously unknown deletions by scanning geno- 
mic DNA with selected multiple sets of primers and the pre- 
natal identification of deletions known to exist in hetero- 
zygous carrier mothers will be greatly facilitated by amplifica- 
tion (Chamberlain et al. 1988). The same procedure should 
distinguish the carrier status of daughters of identified carrier 
mothers. 

The diagnosis of a wide variety of inherited disorders is 
based on restriction fragment length polymorphisms (RFLP). 
The presence or absence of a variable cleavage site, tags muta- 
tions in pedigrees. The reliability of predictions depends on 
the distance between the marker locus and the gene of inter- 
est: the tighter the linkage, the higher the probability that a 
prediction is correct (for reviews see Botstcin et al. 1980; 
Gusjcjla 1986). The usual method for the identification of mu- 
tati0fts via linkage analysis of this type requires sufficient 
quffftities of relatively intact genomic DNA (approximately 
1-itFug per assay), cleavage, gel electrophoresis. Southern 
blcfeingand probing RFLP haplotypcs with radioactive DNA. 
ThetPCR offers a convenient alternative to this time-consum- 
ir>g3nd costly protocol by amplifying the regions, which in- 
duxfe the polymorphic sites, from minute amounts of genomic 
DNA (1 ng or less). The presence or absence of a diagnostic* 
site!* shown directly by cleavage of the amplified DNA frag- 
ment. In cases of heterozygosity, both alleles can be identified 
(se^Feldman ct al. 1988). The only prerequisite is knowledge 
of Jjfe DNA sequence surrounding the polymorphic sites. To 
obtain these sequences requires the effect of cloning (mostly 
of ydsmtd size fragments), subcloning and sequencing. So far, 
oni^a few sites have been sequenced, but this number will 
ccrSinly incceasc in the near future. 

The first successful application of this new approach in- 
volved hemophilia A (Kogan et al. 1987). Polymorphic re- 
striction sites from within the factor VIII gene were used. 
DNA for prenatal diagnosis was extracted from chorionic villi. 
The authors claim realistically, as do others (Williams ct at. 
1988), that a diagnostic result is available within a day, given 
the appropriate experience. Similar results have been ob- 
tained with the prenatal diagnosis of cystic fibrosis (Feldman 
et ai. 1988). In these latter cases, linked extragente marker 
sites were used. The genes responsible for these disorders are 
not known. 

A particularly interesting application in a related context is 
the genotyping of single sperm. Each sperm is the product of 
a single meiotic event; hence, many such events can be investi- 
gated with material obtained from one individual. In an intro- 
ductory study, it was shown that separate genetic loci can be 
analyzed simultaneously by DNA amplification (Li et al. 
1988). This application should allow the measurement of re- 
combination over distances that are shorter than those cov- 
ered by pedigree analysis, in particular if recombinational hot 
*pots arc involved. It is conceivable that the accurate ordering 
of tightly linked RFLPs will be greatly facilitated. The use of 



single sperm may pave the way for a new approach of generat- 
ing genetic maps for species that are not available for selective 
breeding.. t 

The PCR has also been used for prenatal sex determina- 
tion by the amplification of Y chromosome specific DNA se- 
quences (Kogan ct al. 1988). 

Other genetic research where the PCR has been employed, 
as an improved method are genetic epidemiology (including 
pharmacogenetics) and population genetics. Interindividual 
differences in the susceptibilities to environmental compo- 
nents (e.g., alcohol, drugs, pollutants of various kinds) will be 
amenable to rapid analysis by DNA amplification as soon as 
the genes involved in the responses to environmental chal- 
lenges have been identified. The feasibility of using this tech- 
nique has already been demonstrated by genotyping human 
class I alcohol dehydrogenase (ADH) alleles. The method 
allows different allelic variants to be distinguished and thus 
provides a means for the determination of the ADH isoenzyme 
pattern of humans (Gennari et al. 1988). This study includes a 
reliability test by comparing the results obtained by ASO hy- 
bridization with isoenzyme variants isolated from liver speci- 
mens. 

In connection with phylogenctic investigations of human 
populations, length mutations as well as conformational muta- 
tions in human mitochondrial DNA have been determined by 
direct sequencing of amplified DNA (Wrischnik et al. 1987; 
Vigilant et al. 1988). Two primers were used for amplification 
and a third for sequencing. Taking this report into considera- 
tion, together with the case both of sampling DN A -containing 
specimens (one hair is sufficient) and of handling large num- 
bers of assays, one may anticipate that molecular studies on hu- 
man populations can be designed on a larger scale than before. 

The forensic identification of individuals relies on the de- 
monstration of interindividual genetic differences. Genotyp- 
ing of people is possible using genomic DNA and probes, 
most notably those derived from minisatellite DNA sequences 
(now called variable number tandem repeats or VNTR), 
which recognize highly variable RFLPs (Jeffreys et al. 1985). 
However, this approach requires more DNA than can fre- 
quently be obtained from relevant biological materials. Single 
hairs have therefore been taken to extract DNA for a PCR 
analysis. A freshly plucked hair yields about 200 ng DNA. a 
• shed hair 10 ng; 1 ng is obtained if the hair is very old (Higuchi 
et al. 1988a). Mitochondrial DNA, which has extensive se- 
quence polymorphism in its D-loop (Aquadro and Grcenberg 
1983), HLA genes (Higuchi et ah 1988a), single VNTR se- 
quences (Jeffreys et al. 1988) and possibly others can be used 
as polymorphic markers. Obviously, collecting hairs for genet- 
ic analysis may be a convenient alternative to collecting sam- 
ples that are either difficult to obtain or delicate to handle, in 
particular if transport over long distances is needed. 

Clinical research applications 

Extensive use of the PCR can be expected in disease-related 
clinical investigations and diagnostics. Since in many common 
diseases, genetic factors (e.g., somatic mutations, inherited 
disease susceptibilities or multifactorial inheritance) are in- 
volved, this research increasingly has cross-connections to 
classical and molecular genetics. The growing trend to extend 
diagnostic efforts to the analysis of DNA or RNA sequences 
underlines the impact that genetic research has on medical re- 
search in general. 



■ ^ Malignant and infectious diseases were among the first 
I . clinically relevant topics for which the advantages of the PCR ' 
| were realized. The ras protc-oncogcncs, which in mammalian 
organisms form a family 6( at least five closely related non- 
allelic isogencs (two being pseudogencs), acquire their trans- 
forming potential by point mutations in the amino-acid codons 
12, 13 or 61 (for review see Marshall 1986). The standard 
assay used so far to detect mutated ras genes is transfection of 
NIH-3T3 cells. This method is too laborious for routine analy- 
sis of DNA from tumor specimens of patients or experimental 
animals. To make testing faster and at the same lime more 
sensitive, a dot-blot screening procedure for mutated rux 
genes has been devised on the basis of in vitro amplification 
with primers flanking the suspected mutation sites ami allele 
. (or mutation)-spccific oligonucleotides (Vcrlaan-dc Vrics et 
al. 1986). The diagnostic value of the new technique has been 
confirmed in numerous reports from different laboratories 
(Bos ct at. 1987; Kozma el at. 1987; McMahon et al. 1987; 
Jansscn et ah I987a,b; Farr ct al. J988; van't Veer et :il I9K8" 
(.yonsctnl. 1988). \ 

Oncogenes, including ras, with structural change* in their 
activated state tend themselves to this type of analysis since 
DNA sequence alterations arc easily detected. To monitor a 
^?5* C |ranKri P<< onB r activity of an otherwise nun- 

routed oncogene may not be as simple, but will probably 
; tjW^ 0 P°f*'Me. Thus, it may be expected that the amplificn- 
*W^tcehniqucfwilI facilitate tumor typing and tumor progrcs- 
.l£ _?*4Sj : analyses significantly. This includes the possibility of 
'F*^ 10 ""^ success or failure of tumor therapy at the molccu- 

• ^ J«^vel, including the dcteaion of drug resistance in tumors 
j (Layion ct al. 1988; Kashanisabct ct al. 1988). 
.:^-*V3P> e rcR has a,so been usc <* for the diagnosis of chronic 

V:?/- myeloid and acute lymphocytic leukemias that result from a 
J cnrotT, osomal translocation ^(Kawasaki et al. 1988). 

The^fusion leads to the expression of a leukemia-specific 
^>jS£ric mRNA, which combines information of the ABL 
profo-oncogene on chromosome 9 with the "breakpoint clus- 
te^lfcl^^ion* , gene (BCR) on chromosome 22. Visualization of 
'thkptbmiHUttl mRNA relies oji oligonucleotide punters 
located on either side of the junction and on diagnostic 
oligbnucleotides comprising breakpoint/junction sequences. 
The procedure is very sensitive: i pg total cytoplasmic RNA 
from a leukemia cell line is sufficient for the generation of a 
tumor-specific DNA fragment by amplification. Since only 
processed RNA sequences contribute to the appearance of 
this fragment, genomic DNA docs not interfere with the anal- 
ysis 

The detection of cluxunosomal translocations can also be 
achieved by amplifying genomic DNA regions flanking the 
crossover sites. This has been shown for the translocation 
which is characteristic for follicular lymphomas (Lee 
et al. 1987; Crcszcnzi ct al. 1988). The high sensitivity of the 
PCR affords detection of DNA from 1 in 10* cells, permitting 
a diagnosis under conditions that make the application of mor- 
phological, cytogenetic or even molecular analysis (Southern 
blots) difficult or impossible. 

Pathogen detection is the second clinical topic that already 

* i benefits from the new methods, most notably in cases where 

conventional techniques arc not sensitive enough or too cum- 
bcnouK. One such case is the human papilloma virus (HPVV 

(Howley 1987). To facilitate and cxpana HP\ cyping y-tii 
different types are known), the PCR has been adopted for the 



identification of the virus even in paraffin-embedded tissue 
(Shibata ct al. 1988). Furthermore, human retroviruses can 
readily be recognized by amplification. The human T-cc|| | ym . 
phoma virus type I (HTLV-I) (Bhagavati ct al. 1988; Duggan 
ct al. 1988; Kwok ct al. 1988) and the human immunodeficien- 
cy virus (HIV) (Kwok ct al. 1987; Farzadcgan et al. 1988- 
Murakawa et al. 1988; Byrne ct al. 1988) were reliably detect- 
ed by the new methods. Since pro viral DNA sequences (Ou et 
al. 1988) and viral RNA sequences (Byrne et al. 1988; 
Murakawa ct al. 1988) can be amplified separately, it may be 
possible to distinguish between latent and proliferative stages 
of infection and, hence, to monitor disease progression. 

Another clinically important virus is the hepatitis B virus, 
which has been detected in serum samples by in vitro amplifi- 
cation (Larzul ct ai. 1988). 

An important application of the PCR is concerned with the 
analysis of genetic polymorphisms involved in disease suscep- 
tibilities. Endeavors to understand the mechanisms of com- 
plex polygenic human diseases and to recognize them while 
still in a preclinical state have been concentrated on the iden- 
tification of DNA and protein markers associated with auto- 
immune disorders or hypertension and coronary heart dis- 
eases in families and populations. The specific advantage of 
the PCR is (as in prenatal diagnosis or carrier status deter- 
minations) the greatly facilitated assessment of haplotype se- 
quences known or suspected to be involved in the expression 
of a disease phenotype. These sequences may serve as mark- 
ers indicating that a risk of disease exists or, alternatively, 
they may be important because they contribute directly to the 
pathogenic mechanisms. 

One group of genes studies in this context contains the 
HLA class II genes (for review see Kaufman ct al. 1984). They 
code for dimeric cell-surface glycoproteins, which with a few 
exceptions arc highly polymorphic. They are normally expres- 
sed on the surface of B cells and interact with Tcell receptors 
and antigens to activate T cells and immune responses to anti- 
gens. The feasibility of a PCR-bascd approach to study specif- 
ic HLA class It hapiotypes of patients and of control probands 
by amplifying and direct sequencing has so far been demon- 
strated in three reports. They were concerned with the suscep- 
tibility to insulin-dependent diabetes mellitus (Todd et al. 
1987; Gu et al. 1988) and the dermatologic disorder pemphigus 
vulgaris (Scharf ct al. 1988). In both analyses, it was found 
that the disease susceptibility is largely dependent on the iden- 
tity of the amino acid residue at position 57 of the DQp allele 
of the HLA class II gene cluster. Different routes for the anal- 
ysis were chosen in these studies: the amplified template se- 
quences were cither mRNA (Todd et al. 1987) or genomic 
DNA (Scharf ct al. 198$; Gu et al. 1968). 

Other disease-related polymorphisms can also be studied 
by this method. Thus, the genotype of proteins involved in the 
cholesterol metabolism defines different degrees of risks, 
among them that of premature atherosclerosis and coronary 
heart disease (Berg 1986; Weisgraber et al. 1988). It can be 
expected that by screening large groups of probands preclini- 
cal risk assessment based on PCR-mediated genomic analysis 
will be a real possibility in the future, at least for some com- 
mon diseases. 

Applications in basic molecular biology 

In aaaiuai. t; uu ptrrvniimr. in^m"^: ^^ c - 

ical research, the PCR selves as a tool to facilitate complex 



proiocols in basic molecular biology. Small- and large-scale 
sequencing of DNA, in vitro symhesis of genes, site-directed 
recombination in vivo, the modification of DNA sequences in 
vitro or the rapid preparation of DNA probes, gene cxprcs- 
sion studies and other related activities profit from the power 
and improved sensitivity of this method. Only a few analytical 
and preparative applications will be considered here. Many 
more are conceivable or have already been reported 

Obviously, the PGR can be coupled with DNA sequencing 
protocols. In numerous reports, the feasibility of direct 
sequencing of amplified DNA has been demonstrated (Wong 
et al. 1987; Lcvinson et al. 1987; Wrischnik et al. 1987- Todd 
ct al. 1987; Engelkc et al. 1988; Ostc 1988; Cariello'ct al. 
,1988; Scharf et al. 1988). The relevant message emanating 
from these experiments is that minute quantities of DNA or 
RNA arc sufficient for sequencing and that cloning is not re- 
quired. In most cases, the dideoxy chain termination method 
was applied (Sanger et al. 1978). Alternatively, the chemical 
cleavage method (Maxam and Gilbert 1980) could be used 
provided one of the amplification pimcrs carries a 5' radioac- 
tive label. • * * 

A substantial improvement of the sequencing protocols is 
inc recently introduced -asymmetric- PCR, which leads to the 
gcnerajpwi of single stranded DNA by 'using unequal stoichio- 
metncggimounts of the two amplification primers (e.g., 
50 puffer one primer and 0.5pmoJ for the second primer- 
see Gyjfensten and Erlich 1988; Innis et al. 1988). The asym- 
inetncgr* action proceeds conceptually in two steps, starting 
with angular exponential amplification of double-stranded 
DNA Jg long as the limiting primer is -available. Once this 
prinwHfos been used up, the excess primer initiates synthesis 
of wngi^strands in a linear pogression. This DNA can be * 
sequenced directly with the Tag polymerase (Innis et al. 
1988) orjwith the modified 17 DNA polymerase (Scqucnasc) 
(GyJle^sien and Erlich 1988). Sequencing of single strands 
•voids gifficultics which often result from rapid reanncaling of 
complementary strands or from adventitious homology of 
tequengfg primer regions pesent on both strands. 

It h galso been suggested that amplification of DNA and 
RNA molecules be combined with in vitro transcription by 
•ppending a promotor sequence to the amplified DNA. This 
nwdification offers the opportunity of making single-stranded 
RNA molecules in vitro in large quantities for subsequent di- 
deoxy sequencing with reverse transcriptase (Stoflct et al 
1988; Sarkar and Sommer 1988), 

A novel procedure for DNA sequencing that has been con- 
nected with amplification is based on the incorporation of 
ocoxynuclcotide analogues carrying a phosphorothioate sub- 
stitution in the a-position (abbreviated dNTPaS). DNA 
molecules containing these analogues are specifically cleaved 
at the phosphorothioate positions in the presence of 2-iodo- 
cthanol t or 23-cpoxy-l-propanol (Gish and Eckstein 1988; 
Nakatnayc et al. 1988). Modified DNA is synthesized in vitro 
by PCR in four separate polymerization reactions (each with 
a different dNTPaS). Radioactive [»P] label is introduced in a 
«rand-speafic manner, e.g., by amplifying with one un- 
labeled and one 5MabeIcd primer. Since both synthesis and 
?anial cleavage arc fast and easily achieved, the combination 
- amplification with this new sequencing protocol may signif- 
icantly enhance the rate of DNA sequence determination. 

Most PCR sequencing experiments performed so far have 
been based on known sequences flanking the region of intcr- 
**. This type of analysis is mainly targeted at point mutations 
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Hf.7. A PCR-based strategy for the determination of completely un- 
known DNA sequences. Prerequisites are DNA fragment! (400-600 
bp long) with nonidentical recessed ends (restriction enzyme cleav- 
Jigc sites). Doublc-strandcd oligonucleotide adaptors (A and B) are 
ligated to the fragments. After removal of unused adaptor molecules, 
the fragments are amplified using single-stranded adaptor oligo^ 
nucleotides as primers (primer A' and B'). The subsequent separate 
sequencing of both strands of the PCR product is achieved usina 
cither primer A * or primer S' 



or other small sequence alterations, the closing of gaps (about 
600 bp or more) in connection with the sequencing of long 
genes, or the sequencing of unknown introns between exons 
derived from cDNA. (If sufficient amounts of genomic DNA 
are available, conventional "walking primer" protocols with- 
out amplification could, of course, also be used for the latter 
purposes.) 

The PCR procedure can be applied for the amplification of 
sequences that lie outside the boundaries of known regions. 
One approach, designated "inverted- PCR, is based on inver- 
sion of the sequence of interest by in vitro circularization and 
reopening at a different site within the known region. The 
cleavage results in two known sequences flanking an unknown 
region. Using the known sequences as anchor for amplifica- 
tion, sequence determinations may be extended to previously 
undetermined regions (Triglia et al. 1988). 

Another, somewhat related approach for amplifying (and 
sequencing) entirely unknown regions involves ligation of 
adaptor oligonucleotides to the ends of DNA restriction frag- 
ments (isolated, e.g., from cosmids; for the rationale see 
Fig. 7). The adaptors serve as annealing sites for amplification 
primers and subsequently also for priming sequencing reac- 
tions. To allow sequencing of both strands of one amplified 
DNA molecule, the 5' and 3' adaptors have to be different. 
Therefore, DNA fragments framed by non-identical staggered 
restriction sites arc preferable. DNA quantities in the nano- 
gram range will suffice to initiate amplification and sequence 
determination (M. Pfordt, unpublished results). 

The investigation of gene expression by monitoring the 
presence, appearance or disappearance of mRNAs in a few 
cells will presumably become a major application of the PCR. 
Some of the conventional methods (Northern, Si nuclease 
protection mapping) that monitor gene activities on the level 
of transcription may soon be replaced, at least to some extent, 
by the more sensitive new technique. The PCR-bascd trans- 
cript identification has appropriately been designated as 
M mRNA pbenotyping" (Rappolec et al. 1988). This will even- 
tually include the analysis of complex splicing patterns. 

A particularly striking demonstration of the potential to 
unravel new details about transcription and RNA processing 



i 



by PCR was the identification of the translations! stop codon 
in (he mRNA that codes for the apolipoprotein B48 in intesti- 
nal cells. This stop codon is not found in the respective gene. 
It has been suggested that if Results -from a tissue-specific co* 
or post -transcriptional modification of the primary transcript 
(Powell eta!. 1987). 

The verification of gene targeting may be mentioned as an 
additional interesting research application of PCR. The de- 
monstration of homologous recombination between DNA in- 
troduced into recipient cells and endogenous genes requires 
either that a selectable phenotype is induced by the event or 
that a powerful screening procedure is available (sec Kim and 
Smithies 1988, and references cited therein). The few success- 
ful targeting experiments reported so far were mostly based 
on phenotype selection. Screening was also used* but required 
laborious and time-consuming manipulations (Smithies ct al. 
1985). With the availability of the PCR. detection" of recom- 
binants is greatly facilitated even if homologous recombina- 
tion is rare, as is normally the case. Detection is based on the 
demonstration of amplified D^A of predicted -length, which 
can only bo obtained from recombinants and not from non-re- 
combinants or non-homologous recombinants. Oligonucleo- 
tides selected from regions flanking insertion sites are used in 
a manner similar to that employed in the amplification of 
chromosomal translocation breakpoint/junction regions. It 
was Shown in a test system with a previously established 
HPfSF gene-modified recombinant cell that single cells arc 
amenable to recombination analysis by PCR. Since cell mix- 
turcsjfan easily be investigated, sir>sctcction protocols were 
app®blc for the identification of rare events in large num- 

* benMf potential target cells (Kim and Smithies 1988). 

Regarding preparative goals, an important aspect of tha . 
PCfUs the possibility of combining amplification with directed 

* modifications of target sequences, notably at the ends of 
regigns of interest. Primers carrying additional sequences not 
present in the original template (promotors, restriction sites. 

4 othef=f ecognition sequences) at their 5' ends arc unaffected in 
-the inability to initiate a strand extension reaction. From the 

* scconfl cycle on. the additional sequences arc convened into a 
doubjjf *stranded form and participate in further amplification 
u irthey were part of the original target region. Thus, simply 
by the mechanism of the PCR, DNA sequences may be 
altered, in this case by adding new sequences without the need 
of separate synthetic or enzymatic manipulations. 

By analogy, the polymerase chain reaction has also been 
used for the introduction of base-specific mutations into 
amplified DNA. Such modifications require primers, which 
mismatch with a target gene segment. The mismatch is con- 
verted into a point mutation (or a small insertion or deletion) 
by the repeated cycles of DNA synthesis (Rochlitzet at. 1988; 
Higuchi et al. 19&8b). These mutations are located at the ends 
of amplified DNA, but they can be relocated into the middle 
of a larger segment by the PCR mediated in vitro -recombina- 
tion** of two overlapping fragments that carry the same point 
mutation within the region which is common to both frag- 
ments (for details see Higuchi et al. 1988b). 

DNA amplification has also been used for the synthesis of 
DNA molecules several hundred base-pairs in length (Mullis 
et al. 1986). Overlapping oligonucleotides (74 bases in the 
reported example) were mutually extended at their respective 
5' ends in the initial round of amplification. The resulting 
double-stranded DNA molecules were then expanded further 
by additional amplifications using new oligonucleotides. 
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Fig. 8. Appending overlapping DNA sequences to each other. DNA 
molecules that overtop in a hc:id*to-tail fashion at their respective 3' 
and 5* ends (e.g.. partial cDNAs from the same mRNA) can be com- 
bined by denaturalion. annealing and strand filling by a DNA poly- 
merase. Amplification of such product to a preparative level is achieved 
using the primers A and B. Only molecules combined by unncaling 
and completed by strand filling can be amplified. Note that of the two 
possible overlap annealing combinations (designated / and //) only 
that with overlapping 3* ends (combination //) can be processed 
further 



which overlapped with the 3' ends of products made in the 
preceding reactions. The main use of this protocol may be not 
so much the in vitro synthesis of long DNA molecules (genes). , 
but the rapid reconstruction of full-length cDNAs (e.g.. for 
expression cloning) when only overlapping fragments arc 
available (Fig. 8) or, alternatively, the rapid construction of 
chimeric genes. 

In a related approach, selected subregions of regulatory or 
other genomic regions can be accurately isolated and. at the 
same time, modified by amplification, independent of appro- 
priate restriction sites and without recourse to time-consum- 
ing processing and subcloning of DNA. Amplification not 
only provides sufficient quantities of target DNA for cloning, 
•but affords, as mentioned above, the introduction of restric- 
tion sites at the ends of the fragments, allowing rapid insertion 
of DNA molecules into different genetic environments (e.g., 

expression vectors). 

The generation of cDNA with mRNA as the initial 
template is straightforward if primer sequences can be derived 
from homologous genomic sequences or from a closely related 
heterologous sequence of a different species. If appropriate 
information regarding the DNA sequence is not available, 
known protein sequences may guide the synthesis of (degener- 
ate) primers required for amplification. The feasibility of the 
protein approach has been demonstrated by Lee et al. (1988), 
who amplified a partial cDNA probe (112 bp) specific for the 
porcine enzyme urate oxidase using liver mRNA as a tem- 
plate. This enzyme does not exist in humans. The amplified 
fragment was used for the isolation of the complete porcine 
urate oxidase cDNA. This DNA served subsequently as a 
probe for the identification of regions in the human genome 
having homology with the porcine cDNA. Preliminary data 



suggest that sequences related to this non-essential gene exist 
iri the human genome, possibly in the form of a functionally 
inactive single-copy pseudogene. In addition to the signifi- 
cance that this observation may have for the analysis of human 
gene evolution, these authors have convincingly demonstrated 
the usefulness of the PCR for the rapid cloning of cON A sole- 
ly based on partial protein information. (A somewhat unex- 
pected result of this experiment was the observation that mis- 
matching of primer sequences docs not necessarily preclude 
specific amplification; this is also of practical relevance.) 

Working with degenerate primers can be facilitated by in- 
cluding inosine in the primer sequences at positions with a 
high degree of codon degeneration. Since inosine is to a cer- 
tain extent neutral with respect to base pairing, use of this 
base improves significantly the chance that primers recognize 
target sequences that arc only partially known (Knolh ct al. 
1988). 

Another approach to clone cDNA can be applied if knowl- 
edge of a single short stretch of sequence within the mRNA is 
available. In essence, cDNAs arc generated by separately 
amplifying two regions of the dDNA, one reaching from the 
known sequence to the 5' end ahdthc other one extending to 
the 3' end, respectively. For both amplifications two primers 
are needed. One is taken from the known region within the 
mRfijA, and 'he second one is complementary cither to the 
natufjally occurring poly(A) tail of the mRNA at the 3' end or 
to argo)y(dA) tail, which is artificially attched to the 3* end of 
thcffjrst cDN A Strand mode in vitro (for details sec Frohman 
et ajj 1988). This protocol may be particularly useful if the 3' 
encfof a mRNA has been cloned, but the 5' end is missing. In 
thelp[ cases the completion of a partial cDN A to a full-length 
cDNf\ can be obtained with little effort. : 

^Fhe relative ease with which DNAs can be obtained on a** 
. preparative scale by amplification should not conceal the 
caveat that exists regarding sequence fidelity (see the discus- 
lioVpf the technique of the PCR). Amplified and subsequent- 
ly B?ned DNA molecules require additional sequence verifi- 
cation, in particular if the DNA is intended to be used in bio- 
Jo^jeaJ studies or for protein production and protein engincer- 
. ingpn a host system (see, e.g., Takeshita et al. 1988). 

TtnalJy, it should be noted that an intricate and very effi- 
cient scheme for RNA-dependent amplification of RN A mol- 
ecules has recently been reported as an alternative to primer- 
dependent DNA amplification (Lizardi ct al. 1988). In this 
reaction, Q0 replicase was used as the catalyst. The templates 
were recombinant transcripts made in vitro: cRNAs synthe- 
sized by T7 RN A polymerase on appropriate DNA templates. 
These transcripts contained at their 3' ends recognition se- 
quences required for Qp RNA replication (for review, see 
Bicbricher 1983). Neither oligonucleotide priming nor cycling 
between different temperatures was needed. Since both RNA 
template and product strands are accepted for the initiation of 
a new round of replication, the increase in the amount of pro- 
duct is (as for the DNA-dcpendent PCR) exponential. The 
reaction is restricted to (engineered) RNA molecules that in- 
clude Qp-specific signals. This type of amplification may be a 
powerful method for the generation of RNA probes, which at 
the same time can be used as amplifiable "reporter'* molecules 
(i.e., they could indicate the presence of target DNA se- 
quences in membrane-immobilized heterogeneous mixtures). 
Whether this procedure is suitable as a general method for the 
amplification of RNA is as yet not known. The answer de- 
pends critically on the effects that non-Qp-RNA sequences 



covalently linked to Qp-rcplication signals have on the activity 
of OP replicase. 



Conclusions and perspectives 

The PCR with its enormous increase in the sensitivity of the 
analysis of small amounts of DNA or RNA docs not replace 
existing molecular methods, but rather adds, by its superior 
resolution properties, to the potentials of established or newly 
developed procedures (e.g., Myers and Maniatis 1986; Church 
and Kieffcr-Higgins 1988; Landcgren et al. 1988). 

The concept of the PCR is theoretically straightforward 
and many successful experiments have been reported, but it is 
not fully established with respect to ail of its aspects. Limits in 
its applicability exist at the moment regarding target length 
and sequence fidelity. The latter complication may turn out to 
be a somewhat serious drawback of the method unless condi- 
tions can be defined in which the rate of misincorporations can 
be substantially reduced. In exceptional cases, secondary 
structure of template molecules (e.g., inverted repeats) may 
block synthesis to an extent which constrains amplification. 
Although it has been reported that extensive purification of 
crude nucleic acid extracts is not necessary, this may not al- 
ways be the case. If perfect matching of primers is not re- 
quired for an amplification to be specific, competing reactions 
in crude mixtures may complicate the results. It is obvious; 
that, because some experimental standardization is still lack-* 
ing, PCR is not yet a routine procedure for diagnostic pur- 
poses. In addition, the reaction is rather expensive for the 
daily use. 

Nevertheless, it is already quite clear from numerous pub- 
lished results that most conditions (incubation times, concen- 
trations of reaction components, temperatures for annealing 
and synthesis, lengths and base compositions of primers, etc.) 
can be tailored to suit very different analytical and preparative 
purposes. Thus, the PCR will become an advanced and fairly 
general technique for the study of genes and gene activities. 

The typical diagnostic application in genetics will be the 
DNA or RNA sequence, which is altered by mutation, poly- 
morphic variation, deletion, translocation, recombination or 
related processes. In addition, extensive gene dosage varia- 
tions caused, e.g., by repeated duplication of genes in the 
genome (this process is also called amplification), may easily 
be recognized. Loss or gain of whole chromosomes (mono- or 
trisomies) and major rearrangements are probably more reli- 
ably recognized by recently developed cytogenetic techniques 
(designated in situ suppression hybridization), which allow the 
visualization of chromosomes and subchromosomal regions 
with a very high resolution even in interphase nuclei (Cremer 
ct al. 1988; Lichter et al. 1988). 

The importance of PCR for the progress of genetic re- 
search will in the long run be significant in more than one 
respect. A few points regarding possible developments may be 
emphasized. 

1. With regard to molecular biology, amplification will contri- 
bute to the rate of DNA sequence acquisition. Furthermore, 
gene activity studies will benefit from the analysis of very few 
cells and ultimately single cells. Finally, gene targeting may 
gain ground with the availability of a sensitive and relatively 
fast method for the monitoring of recombinational success in 
cases not amenable to selection. It is even conceivable that, in 



- >K - conjuction wiih elaborate microinjection techniques (Ansorgc 
* and Pepperkok 1988), new gene replacement strategies can be 
designed. * -i 

2. The analysis of single ^pcrm offers the chance of construct- 
ing an improved recombination map of the human genome. 
PCR-derived data will eventually complement information 
that is currently being collected by other powerful molecular 
techniques, such as pulse-field gradient gel electrophoresis 

- (PFGE) (Schwartz and Cantor 1984) and chromosomal jump- 
ing (Poustka et al. 1987)._ 

3. Hie intensity of diagnostic activities in genetic research will 
generally increase. It is quite obvious that PCR will guide the 
mutation analysis of identified disease genes by allowing 
direct sequencing of DNA. Since amplification of genomic 
DNA docs not depend on the presence of restriction sites, the 
term "RFLP~ can be considered as describing a special case of 
the general phenomenon "DNA sequence polymorphism** 
(DSP, D. W. Yandcll, personal communication). Screening, 
at least for certain genetic traits, may become feasible in the 
general population in addition to families at risk. Advanced 
knowledge will become available about the causes and patho- 
genic mechanisms of multifactorial disorders or, more specifi- 
cally, about the contributions of genetic polymorphisms to 
common diseases. The concept of risk prediction may come 
yahe step closer to its realization. (Considering the potential of 
D§CR, we may approach the point where we have to decide 
fttbw much prediction we want.) 

jj! As a consequence of this development, a tight link will 
Mfevclop between human genetic research (including clinical 
) Nineties) and basic molecular biology. Progress in human 
- • ? cnctia '* Iar 8 ci Y based on the observation of random events 
*»-Qp non-sclectively breeding populations rather than on dc- 
.ij^^iiberite experimentation. The possibility of extending the 
. ^V-: S? n * c of observations to cells, molecules and genes of indi- 
^;V^vJ! dua, I* 1 * 005 and °f comparing them with each other will sig- 
V^r* '^fljficantly advance our concepts of human biology and genetics 
/uT'l^fsfi 8 * thc rcccnl discussion by While and Caskey (1988) with 
..£.*"*; ^t?8 ard lo *he human as an experimental system in molecular 

. * "This list is certainly incomplete, and one might argue as to 
where the main emphasis should be placed. Additional rele- 
vant applications in both medical science and molecular bio- 
logy are likely; these will eventually confirm the PCR as a 
major technical breakthrough. 
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